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(54) A method and a device for recognising speech 



(57) According to the invention, speech recognition 
can be limited to a smaller number making use of com- 
mands (16, 17) given by a user By means of a method 
and device, according to the invention, it is also possible 
to activate a speech recognition device by making use, 
in the activation, of an existing keyboard or a touch-sen- 
sitive screen of the device. A method, according to the 
inventbn, provides the user with a togical way to activate 
the speech recognition device in the device simultane- 
ously providing an improved capacity of the speech rec- 
ognition dev'ice. According to the invention, it is also pos- 



sible to carry out the speech recognition process by 
making use of commands given by the user irrespective 
of how the speech recognition device itself is activated. 
According to an embodiment of the invention, the device 
comprises a touch-sensitive screen or surface, in which 
case the information about a single tetter or several let- 
ters written on the screen is transmitted to the speech 
recognition device, whereupon speech recognition is 
limited to words, wherein the letters in questton occur. 
Speech recognition is most preferably limited to a name 
beginning with the letter written on the touch screen by 
the user. 
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Description 

[0001] Tlie present invention relates to a method for 
recognising speech and a device that utilises the speech 
recognition method according to the invention. 
[0002] Nonmally. in mobile telephones, it is possible 
to browse through a telephone notepad to select a name 
by making use of the first letter of the name searched 
for. In this case, when a user during the search presses, 
e.g. the letter V, the names beginning with the letter 
"s" are retrieved from a memory. Thus, the user can 
more quickly find the name he/she is looking for without 
needing to browse through the content of the notepad 
in alphabetical order in order to find the name. This kind 
of method is fully manual and is based on the commands 
given by the user through a keyboard and the browsing 
of a memory based on this. 

[0003] Today, there are also some mobile stations that 
utilise speech recognition devices, wherein a user can 
give a comrr^nd by voice. In these devices, the speech 
recognition device is often speaker-dependent; i.e. the 
operatkxi of the speech recognition device is based on 
that the user teaches the speech recognition device 
words that the speech recognition device is supposed 
to later recognise. There are also so-called speaker-in- 
dependent speech reoognitk>n devices for which no 
separate training phase is required. In this case, the op- 
eration of the speech recognitkxi device is based on a 
large amount of teaching material compiled from a large 
sampling of different types of speakers. Moderate oper- 
ation in case of a so-calted average user is typical of a 
speaker-independent recognition device. Correspond- 
ingly, a speaker-dependent speech recognition device 
operates best for the person who has trained the speech 
recognition device. 

[0004] It is typical of both speech recognitkDn devices 
mentbned above that the performance of the speech 
recognition device greatly depends on how large a vo- 
cabulary is used. It is also typical of speech recognition 
devices according to prior art that they are limited to a 
specific number of words, which the speech recognition 
device is capable of recognising. For example, in mobile 
stations, a user is provided with a maximum of 20 
names, which he/she can store in a notepad within the 
telephone by voice and. correspondingly, use these 
stored names in connection with voice selection. It is 
quite obvbus that such a number is not sufficient in 
present or future applicatbns, where the objective is to 
substantially increase the number of words to be recog- 
nised. As the number of words to be recognised increas- 
es, e.g. ten-fold, with current methods. It is not possible 
to maintain the same speech recognition capacity as 
when using a smaller vocabulary. Another limiting factor, 
e.g. In terminal equipment, is the need for a memory to 
be used, which naturally increases as the vocabulary of 
the speech recognition device expands. 
[0005] In current speech recognition devices accord- 
ing to prior art, the activation of a speech recognition 



device can be implemented by voice using a specific ac- 
tivation command, such as e.g. "ACTIVATE", whereup- 
on the speech recognition device is activated and is 
ready to receive commands from a user. A speech rec- 

s ognltion device can also be activated with a separate 
key. It is typical of speech recognitkm devices activated 
by voice that the performance of the activation is de- 
pendent on the noise level of the surroundings. Also dur- 
ing the operation of the speech recognition device, the 

10 noise level of the surroundings greatly affects the per- 
formance of the speech recognition device to be 
achieved. It can be said that critical parameters for the 
performance of a speech recognition device are the ex- 
tent of the vocabulary and the noise conditions of the 

IS surroundings. 

[0006] A further known speech recognition system is 
disclosed in US 4,866.778 where a user can select a 
sub-vocabulary of words by selecting an initial string of 
one or more letters causing the recognition to be per- 

20 formed against the sub-vocabulary restricted to words 
starling with. those initial letters. 
[0007] Now, we have invented a method and a device 
for recognising speech the objective of which Is to avoid 
or, at least, to mitigate the above-mentioned problems 

2S of prior art. The present inventk>n relates to a devk;e and 
a method, wherein a user is allowed to give, during 
speech recognition, a qualifier by means of which 
speech recognition is only limited to those speech mod- 
els that correspond with the qualifier provided by the us- 

30 er In this case, only a specific sub-set to be used during 
speech recognition is selected from the prestored 
speech models. 

[0008] According to an embodiment of the invention, 
a speech recognition device is activated at the same 

35 time as a qualifier that limits speech recognitbn is pro- 
vided by touching the device making use of the existing 
keyboard or touch-sensitive screen/base of the device. 
The activation is most preferably implemented with a 
key. A method according to the inventbn provides a user 

40 with a logical way to activate the speech recognitton de- 
vice therein at the same time providing an Improved per- 
formance of the speech recognition device along with 
the entered qualifier. The limitation of speech recogni- 
tion according to the invention can also be Implemented 

45 apart from the activation of the speech recognition de- 
vice. 

[0009] According to an exemplary embodiment of the 
invention, the device comprises a touch-sensitive 
screen or surface (base), whereupon the Information 

so about the character or several characters written on the 
screen is transmitted to the speech recognition device, 
in which case speech recognition is limited to words 
wherein the characters in question occur Speech rec- 
ognition is most preferably limited to a name beginning 

ss with the character written by the user on the touch 
screen. 

[001 0] According to an exemplary embodiment of the 
invention, speech recognition can also be implemented 
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by making use tn advance of all the stored models and 
by utilising the limiting qualifier provided by the user 
when defining the final recognition result. 
[0011] According to a first aspect of the invention 
there is provided a method for recognising an utterance s 
of a user with a device, wherein a set of models of the 
utterances have been stored in advance and for speech 
recognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a recogni- 
tion decision is made, the method being characterised 
in that. 



comparison made by the comparison means to said set 
of nrKxlels and means for storing a menu structure of a 
device and for identifying the received qualifier as an 
item in a menu structure of the device. 

Figure 1 shows the structure of a speech recognition . 

device, according to prior art. as a block di- 
agram, 

Figure 2 shows the structure of a speech recognition 
device, according to the invention, as a 
block diagram, 
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- the user is allowed to provide a qualifier limiting the 
comparison by touching the device, the qualifier 
itentifying an item In a menu structure of the devkse. 

- a sub-set of models is selected from the stored 
models on the basis of the qualifier provided by the 
user said sub-set of nrtodels kientifying sub-items of 
the menu structure, and 

a comparison is made for making the recognition 
decisbn by comparing the utterance of the user with 
said sub-set of models. 

[001 2] According to a second aspect of the invention 
there is provided a method for recognising an utterance 
of a user with a device, wherein a set of models of the 
utterances have been stored in advance and for speech 
recognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a recogni- 
tion decision is made, the method being characterised 
in that, 

- a comparison is made for making a first recognition 
decisbn by comparing the utterance of the user with 
the prestored models. 

the user is allowed to provide a qualifier limiting the 
comparison by touching the devbe for selecting a 
sub-set of models, the qualifier itentifying an item in 
a menu structure of the device and said sub-set of 
models identifies sub-items of the menu stnjcture, 
a final comparison is made for nnaking the recogni- 
tton decision by comparing the first recognitbn de- 
clsk)n with said sub-set of models. 

[0013] According to a third aspect of the invention 
there is provided a device comprising a speech recog- 
nitbn device for recognising the utterance of a user, 
memory means for storing speech nrKxiels, and means 
for receiving the utterance of the user, comparison 
means for carrying out the recognition process by com- 
paring the utterance of the user with the models stored 
in the memory means, the device being characterised 
in that the device also comprises means for receiving a 
qualifier from the user by touching the device, means 
for selecting a set from the stored nrKxJels on the basis 
of the qualifier received from the user for limiting the 



Figure 3 shows the operation of a method, accord- 
T5 ing to the invention, as a flowchart. 

Figure 4 shows the operatbn of another method, ac- 
cording to the Inventksn, as a fk>wchart, and 

20 Figure 5 shows the structure of a mobile station uti- 
lising a method according to the inventk^n. 

[0014] Figure 1 shows the block diagram structure of 
a known speech recognitk>n device as applicable to the 

2S present inventton. Typfcally. the operation of the speech 
recognition devtee is divbed into two different main ac- 
tivities: an actual speech recognitbn phase 10-12, 
14-15 and a speech training phase 10-13. as shown in 
Figure 1. The speech recognitk^n device receives from 

30 a microphone as its input a speech signal S(n). whbh is 
converted into a digital form by an A/D converter 10 us- 
ing, e.g. a sampling frequency of 8 kHz and a 12-bit res- 
olution per sample. Typically, the speech recognition de- 
vice comprises a so-called front -end 11, wherein the 

35 speech signal is analysed and a feature vector 12 is 
modelled, the feature vector describing the speech sig- 
nal during a specific perk>d. The feature vector is de- 
fined, e.g. at 10 ms intenrals. The feature vector can be 
modelled using several different technk^ues. For exann- 

40 pie, different kinds of techniques for modelling a feature 
vector have been presented in the reference: J. Picone. 
■Signal modeling techniques in speech recognition", 
IEEE Proceedings, Vol. 81, No. 9, pp. 1215r1247. Sep- 
tember 1993. During the training phase, models are 

45 constructed by means of the feature vector 1 2. in a train- 
ing block 13 of the speech recognition device, for the 
words used by the speech recognitk>n device. In model 
training 1 3a, a nrKXlel is defined for the word to be rec- 
ognised. In the training phase, a repetitkxi of the word 

so to be nrK>delled can be utilised. The models are stored 
in a memory 1 3b. During speech recognitton, the feature 
vector 12 is transmitted to an actual recognition devtee 
14, which compares, in a block 15a, the models con- 
structed during the training phase with the feature vec- 

55 tors to be constructed of the speech to be recognised, 
and the decision on the recognition result is made in a 
block 15b. The recognition result 15 denotes the word, 
stored in the memory of the speech recognition device. 
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that best corresponds with the word uttered by a person 
using the speech recognition device. 
[001 5] Figure 2 shows the operation of a speech rec- 
ognltton device according to the invention where, in ad- 
dition to the solution according to Figure 1, the speech 
recognition device comprises a block 16, wherein the 
selection of the models is carried out on the basis of the 
commands given by a user, e.g. through a keyboard. 
The block 16 receives as its input a signal 17 containing 
the information on which key the user has pressed. In 
the block 16. speech models 18. transmitted by the 
block 13b, are compared with the signal 17 and a sub- 
set 19 is selected from these and transmitted to the 
btock 15a of the speech recognition devce. The selec- 
tion of the models relating to the operation of the block 
16 has been described bek)w making use of a memory 
structure according to the present inventbn. 

Table 1 



Name 


Number 


Reference Model 


Smith Paul 


0405459883 


XXX... X 








Table 2 
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IS 



20 



2S 



Menu 








Phone Settings 






Messages 








Read messages 






Write messages 




Memory Functions 





3S 



[0016] Table 1 shows a menrK>ry structure according 
to the invention, which may form. e.g. a phone notepad 
of a mobile station or part of it. The merinory comprises 
the name of a person, a telephone number that corre- 
sponds with the name, as well as a reference model (e. 
g. a feature vector) constructed during the speech rec- 
ognition training phase. The table shows as an example 
one line of the table, on which a person's name 'Smith*, 
a corresponding telephone number '0405459883' . as 
well as a data field containing a reference model "xxx... 
x" are stored. The length of the reference model is a 
speech recognition devrce-specific parameter and, 
therefore, the field length depends on the speech rec- 
ognitkxi device used. According to the invention, when 
a user presses a specific key of the device, e.g. the key 
"s*. the processor of the device goes through the content 
of the memory and compares the content of the data 
field containing the name and only retrieves from the 
memory the names that begin with the letter "s". The 
comparison can be made, e.g. by comparing the ASCII 
character of the pressed key with the ASCII character 
of the first letter of the name in the memory and by se- 
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so 



lectin g the reference model that corresponds with the 
name provided that the letters correspond with one an- 
other in the comparison. The information on the selected 
reference models (sub-set) is then transmitted to the 
speech recognitbn device, after which the speech rec- 
ognition devrce carries out speech recognition using the 
models that relate to the names selected above. 
[001 7] The user can further also press another key, e. 
g. the key *m', whereupon speech recognition is limited 
to the names beginning with the letter combination 
"Sm". In this case, the number of names to be recog- 
nised can be further limited, i.e. the sub-set of models 
decreases. In addition, it is also possible that the mem- 
ory contains fields other than the above-mentioned 
name field on the basis of which the speech recognition 
devtee is activated according to the invention. The tele- 
phone memory of a device, such as a mobile statnn, 
may contain, e.g. a fiekJ that indicates whether a specific 
number is a mobile station number or not. In this case, 
the memory field may contain, e.g. an element 'GSM', 
whereupon when the user activates this field only the 
GSM numbers are selected and not the others, e.g. the 
numbers of a fixed network or fax numbers. Thus, the 
invention is not limited to a case wherein the letter se- 
lected by the user controls the operation of the speech 
recognitkxi devk:e but, instead, the user may select 
names, e.g. from a telephone notepad according to 
some other classification. For example, the names in a 
telephone notepad may have been divktod into classes 
like •Home", 'Office', 'Friends", in whteh case the mo- 
bile station may prcvkJe a convenient way to select from 
the menu, e.g. the class "Friends", whereupon speech 
recognition is directed to the names in this class, ac- 
cording to the invention. It is also possible that the mo- 
bile station comprises a keyboard, wherein several dif- 
ferent characters are combined in a specific key. For ex- 
ample, the letter symbols "J, k, I' may be included in the 
numeric key "5'. In this case, the invention can be ap- 
plied so that when the user presses the key "5", the 
speech recognitbn device is activated so that, in speech 
recognitkMi. it is limited to names beginning with the let- 
ters "j", 'k' or T. In an exemplary embodiment of the 
invention, when the user presses the key SEND speech 
recognitbn, according to the invention, can be limited, 
e.g. to the last made calls (e.g. the last 10 calls). In this 
case, a call can be commenced, e.g. by pressing and 
holding the key SEND, while the user simultaneously 
pronounces the name to be recognised, whereupon 
speech recognition is limited to a set of models contain- 
ing the name/symbol of the last 10 calls. 
[0018] The speech recognition devbe is most prefer- 
ably activated by a press-and-hotd. whereupon the de- 
vice (speech recognition device) is informed by the 
pressing and holding of the key in question that you want 
speech recognition to commence. At the same time, the 
information about the pressed key is transmitted to the 
speech recognition device, i.e. speech recognition is 
limited, e.g. to words beginning with the letter on the key, 
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whereupon only the reference models that the user 
wants are activated. It is also according to the invention 
that the speech recognition device is activated in a way 
other than by pressing a key, tor example, by voice. !n 
this case, after the activation of the speech recognition s 
device, it is possible to use, during speech recognition, 
the reference nrKxJel selection according to the Invention 
as presented above. 

[0019] An arrangement, according to the invention, 
can also be made for the menu structure of a mobile io 
station, as shown in Table 2, Table 2 shows a specific 
part of the menu structure of a telephone. In this exam- 
ple, the main menu consists of the menus "Telephone 
Settings', "Messages' and 'Memory Functions'. Corre- 
spondingly, the menu "Messages' consists of the sub- is 
menus "Read messages" and "Write messages". When 
a user of the telephone selects a menu function by voice 
or pressing a menu key, activation is limited to the points 
in the menu. In the example, activation by vofee is di- 
rected to the menus "Telephone Settings", Messages" ^ 
or "Memory Functions". The user can further manually 
select the submenu "Messages", in which case activa- 
tion by voice is directed to the points 'Read messages' 
or "Write messages" of the menu in question. The 
above-mentioned method can also be applied to exte- 2S 
mal services for a mobile station and their activation. In 
this case, a specific key of the mobile station is defined 
for a specific sendee, e.g. tor a WWW service (WorW 
Wide Web). In this case, the pressing and hoWing of the 
key in question enables, e.g. the selection of a book- so 
mark of WWW addresses by using a voice command. 
In this application, the mobile statton contains a table of 
letter symbols, which are selected as described above. 
[0020] Figure 3 shows the activity sequence of a 
method according to the invention. In Phase 30, it is dis- 3S 
covered whether or not a user has carried out the press- 
and-hold that activates the speech recognition device. 
If no press-and-hold is discovered, the devne remains 
in a state where the activatkxi of the speech recognition 
device is being anticipated. Altematively. the speech 40 
recognition device can be activated at the same time, 
when the user starts writing on a touch-sensitive sur- 
face, such as a screen. The activatk^n of the speech rec- 
ognition device may also be based on voice. In Phase 
31 , the letter/text written on the touch screen is recog- 4S 
nised. In Phase 32, the information about the pressing 
of the key is transmitted to the speech recognition de- 
vice and/or the information about the alphanumeric 
character written or drawn by the user on the touch 
screen is transmitted. It is also possible to draw, on the so 
touch screen, some other figure that deviates from an 
alphanumeric character, which is utilised in speech rec- 
ognition. In Phase 33, it is examined whether or not the 
user is still carrying out the pressing of keys or writing 
on the touch screen, in which case the information about ss 
these activities is also transmitted to the speech recog- 
nition device. This can be done by comparing the activ- 
ities of the user with a specific time threshold value by 
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means of which it is decided whether or not the user has 
concluded the giving of commands. In Phase 34, the 
word pronounced by the user is recognised by making 
use of the information provided in Phase 32. 
[0021 ] Figure 4 shows another activity sequence of a 
method according to the invention. In this method, the 
pronounced word is first recognised traditionally and. 
only after this, the limitation provided by the user is uti- 
lised to limit the result obtained during the recognition 
phase. In Figure 4, Phases 30-33 correspond with the 
corresponding phases in Figure 3. In Phase 35, the ut- 
terance of the user is recognised by rrraking use of all 
the prestored models. The information about this recog- 
nition result is transmitted to Phase 34, wherein the final 
recognition decision is made by comparing the first rec- 
ognition decisran with said sub-set of nrradels, which has 
been obtained on the basis of the limrtatkxi provided by 
the user. The recognitkxi decision obtained from Phase 
35 contains a set of proposed words that have been rec- 
ognised and the recognitk>n probabilities corresponding 
with the words, whfeh are transmitted to Phase 34. In 
case of a faulty recognition, the word that has got the 
highest recognition probability is not the word pro- 
nounced by the user. In this case, in Phase 34 according 
to the invention, it is possible to carry out the final 
speech recognition phase by means of the qualifier pro- 
vided by the user and reach a higher speech recognition 
performance according to the inventbn. A method ac- 
cording to the invention may also operate so that the 
giving of a limitatton and the recognition of a pronounced 
word are essentially simultaneous activities. 
[0022] Figure 5 shows the stmcture of a mobile sta- 
tion, which has a speech recognitton device 66 that uti- 
lises the present invention. The mobile station compris- 
es parts typical of the device, such as a microphone 61 , 
a keyboard 62. a screen 63, a speaker 64. and a control 
block 65, which controls the operation of the mobile sta- 
tion. According to an embodiment of the invention, the 
screen 63 may be a touch-sensitive surface, such as a 
screen. In addition, the figure illustrates transmitter and 
receiver blocks 67. 68 typical of a mobile station. The 
control block 65 also controls the operation of the 
speech recognition device 66 in connection with the mo- 
bile station. When the speech recognition device is be- 
ing activated either during the speech recognitton de- 
vice's training phase or during the actual speech recog- 
nition phase, the voice commands given by the user are 
transmitted, controlled by the control block, from the mi- 
crophone 61 to the speech recognition device 66. Ac- 
cording to the invention, the control block 65 transmits 
to the speech recognition device 66 the information 
about the commands given by the user through keys or 
about the alphanumeric character/figure entered on to 
the touch screen. The voice commands can also be 
transmitted through a separate HF (hands free) micro- 
phone. The speech recognition device is typically imple- 
mented by means of DSP and it comprises external and/ 
or internal ROM/RAM circuits 69 necessary for its oper- 
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ation. 

[0023] An embodiment of the present invention may 
comprise a device, e.g. a mobile station, which has a 
touch-sensitive surface, such as a touch-sensitive 
screen or base, in this case, a user writes the first letter 
of the word to be recognised on the touch-sensitive sur- 
face, e.g. with a pen or draws with a finger and simulta- 
neously pronounces the word to be recognised (alter- 
natively, the user presses the point of the letter dis- 
played on the screen). In this case, the information 
about the provided letter is transmitted to the speech 
recognition device and speech recognition is limited to 
words in which the tetter in question occurs. Recognition 
is most preferably limited to words that begin with the 
initial in question as described above. In this case, the 
user may write, according to the invention, on the touch- 
sensitive surface, e.g. the letter 'S' and simultaneously 
pronounce the name to be recognised, e.g. 'Smith', 
whereupon speech recognition is limited to names be- 
ginning with the letter "S*. 

[0024] Alternatively, the user may first write a letter on 
the touch screen and, after this, pronounce the word to 
be recognised. The atx5ve-mentioned method based on 
keys and writing on a touch-sensrtive surface can also 
be combined, in which case the user can both write on 
a touch-sensitive surface and press some key and uti- 
lise both of these data in speech recognition. The touch- 
sensitive surface In itself is outside this invention, and it 
can be implemented in vark^us ways according to prior 
art. 

[0025] It can be estinnated that with a method, accord- 
ing to the present invention, a recognition accuracy, 
which is 10-30-fold compared with recognitbn devices 
according to prior art can be achieved if the number of 
names to be recognised remains the same. On the other 
hand, by means of the invention. It is possible to recog- 
nise according to the invention 10-30 times as many 
names can be recognised while the recognition accura- 
cy remains unchanged. This improved capacity is based 
on a combination, according to the inventbn, whereup- 
on commands given by the user through keys/a touch- 
sensitive surface, i.e. qualifiers limitoig speech recogni- 
tion search, are combined with speech recognition. One 
exemplary embodiment of the invention was based on 
the utilisation of a touch screen. An advantage of this 
application is that the algorithms used in text recognition 
and speech recognitbn are almost identical, whereupon 
the amount of program memory required does not in- 
crease much in a device, where both of these functions 
are implemented. 

[0026] Above we described a mobile station as an ex- 
emplary embodiment of the present inventkin. However, 
the invention can equally well be applied, for example, 
to computers. The present inventbn is not limited to the 
embodiments presented above, and it can be modified 
within the framework of the enclosed claims. 
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Claims 

1 . A method for recognising an utterance of a user v/ith 
a device, wherein a set of models of the utterances 

s have been stored in advance and for speech rec- 
ognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
models and, on the basis of the comparison, a rec- 
ognition decisksn Is made, characterised in that. 

10 in the method, 

the user is allowed to provide a qualifier limiting 
the comparison by touching the devce, the 
qualifier itentifying an item in a menu structure 

IS of the device. 

a sub-set of models is selected from the stored 
models on the basis of the qualifier provided by 
the user said sub-set of nxxiels identifying sub- 
items of the menu structure, and 

20 - a comparison is made for making the recogni- 
tion decision by comparing the litterance of the 
user with said sub-set of models. 

2. A method for recognising an utterance of a user with 
25 a device, wherein a set of models of the utterances 

have been stored In advance and for speech rec- 
ognition, the utterance of the user is received, the 
utterance of the user is compared with the prestored 
nfKxfels and, on the basis of the comparison, a rec- 
30 ognrtion decisbn is made, characterised in that, 
in the method, 

a comparison is made for making a first recog- 
nition decision by comparing the utterance of 

3S the user with the prestored models, 

the user is albwed to provide a qualifier limiting 
the comparison by touching the device for se- 
lecting a sub-set of models, the qualifier itenti- 
fying an item in a menu structure of the devbe 

40 and said sub-set of models identifies sub-items 

of the menu structure, 
- a final comparison is made for making the rec- 
ognition decisbn by comparing the first recog- 
n'ltkm decisbn with said sub-set of models. 

45 

3. A method according to claim 1 or 2, characterised 
in that a speech recognition device Is activated in 
response to the qualifier provided by the user. 

50 4. A method according to claim 1 or 2. characterised 
in that the user is allowed to give said qualifier by 
pressing a key. 

5. A method according to claim 1 or 2. characterised 
55 in that the user is albwed to provide said qualifier 
by writing an alphanumeric character on a touch- 
sensith/e surface of the device. 
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6. A method according to claim 3 or 4, characterised 
in that the user is allowed to provide said qualifier 
as a press-and-hold. 

7. A device comprising a speech recognition device 
(66) for recognising the utterance of a user, memory 
means (69) for storing (1 3b) speech models, and 
means (61 ) for receiving the utterance of the user, 
comparison means (19. 15a, 15b) for carrying out 
the recognition process by comparing the utterance 
of the user with the models stored in the memory 
means, characterised in that the device also com- 
prises means (62. 63) for receiving a qualifier (17) 
from the user by touching the device, means (16) 
for selecting a set from the stored models on the 
basis of the qualifier received from the user for lim- 
iting the comparison made by the comparison 
means (19, 15a, 15b) to said set of models and 
means (65) for storing a menu structure of a device 
and for identifying the received qualifier as an item 
in a menu structure of the device. 

8. A device according to claim 7, characterised in that • 
the means for receiving the qualifier from the user 
comprise a keyboard. 

9. A device according to claim 7, characterised in that 
the means for receiving the qualifier comprise a 
touch-sensitive surface. 

30 

10. A device according to claim 7, characterised in that 
it comprises means (62. 63, 65) for activating the 
speech recognition device in response to the qual- 
ifier received from the user. 
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